CDS

Accession Number TCMCG021C11019
gbkey CDS
Protein Id XP_010919474.1
Location join(43226509..43226604,43227052..43227166,43227257..43227336,43232153..43232228,43232325..43232389,43232496..43232558,43234511..43234592,43234691..43234853,43240226..43240300,43240462..43241209)
Gene LOC105043591
GeneID 105043591
Organism Elaeis guineensis

Protein

Length 520aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010921172.3
Definition U1 small nuclear ribonucleoprotein 70 kDa [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category A
Description U1 small nuclear ribonucleoprotein
KEGG_TC -
KEGG_Module M00351        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K11093        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGAGACTACGGCGATGCCATGATGCGCAATAACGCCGCCGTCCAGGCGCGCACGAAAGCCCAGAACCGAGCTAACGTGCTTCAGCTCAAGCTGATTGGGCAGAGTCATCCTACTGGCCTTACCGCCAATCTTTTGAAGCTTTTTGAGCCCCGACCTCCTTTGGAGTATAAACCTCCTATTGAGAAGAGGAAATGCCCCTCATATACAGGGATGGCACAATTTGTGAGTCATTTTGCCGAGCCTGGGGATCCTGAATATGCTCCACCCGTCGAAAAGGGTGAAACCCCCGCACAAAGAAGAGCTAGAATCCACAAGCTTCGGCTGGATGAAGGTGCAAGAAAAGCTGCTGAAGAGCTCGAGAAATATGATCCACATAAAGACCCTAATATAACTGGGGATCCATACAAGACACTGTTTGTGGCAAGGCTTAACTTCGAGACTACTGAGCACAGGATCAAAAGGGAGTTTGAAGCTTATGGGCCAATCAAACGGGTCCGGCTGATTACTGATAAGGTGACAAATAAGCCTAGAGGATATGCCTTCATCGAATATATGCATACTCGAGATATGAAAACTGCTTACAAGCAAGCTGATGGGAGGAAAGTGGATAATAAAAGAGTACTTGTGGATGTTGAGCGTGGTAGAACTGTTCCAAATTGGCGACCTCGAAGATTGGGTGGAGGACTTGGATCAACCAGGATAGGAGGTGAAGAGGTTAATCAGAAGTATTCTGGCAGGGAGCAACAGCAAGTTGCATCTGGACGTCCTAGATCAGAAGAGCCTAGGTCCAGGGATGACCGCCTTTTGGATCGGGAGAAGTCTCGAGAAAGAGGAAGGGAACGTGAGCGAGAGAGGTCACGTGAACGGTCCTATGACAGGACACGGGATCGTGATACCAGAGAAGAAAGGCACCACCACAGAGATCGGGATAGGAATAGGGACAGGGACAGAGACAGGGAAAGAAATCGTGGGCGTGACCGTGATCGGGCCAGTGACAGAGATAGGGAGAGAGACCGTGGCCGTGACTATGACCGAGATCGGGAGCGTGAACGTGATCGTGACCGTCCTCGTGAGAGGGAGCGGGAGAGGGAACGTGACAGAGATTATGACCGGGCAAGTCATGAAAGAGGCCGTGGGCATACACATGAGAGGGATGCTCACTATGATTATATTGAGCCAAAGCATGATAGGGAGATGCCTGGGATGAATGTGAGAGACTTTGATTATGGAGAATCTAATCATGGAAGAGAGTGGTATGATGGGCCTAAGCATGGGCAGGAACATGATTATTATCGGTATGAACAACAGAGAAATCAGGAGCAATATGATTATCAAGAACACCATGGTCTCGGCGATCCTCAGCATGATTTGGAGCATCCTAGGCGACATGATCATGAATACTATGACCATGCCCCCTATGATAAGGTGGATCCTGTCAATTATCACAGTGAATTTAATCGTGCAGGATCTGAATCACGTGAGGAGGGTGAGGCATTCGGTGACCAGGATTATGAGCATCATCGCTCAGAGAGATCACTTTCCCATGAATATGAAAACTGA
Protein:  
MGDYGDAMMRNNAAVQARTKAQNRANVLQLKLIGQSHPTGLTANLLKLFEPRPPLEYKPPIEKRKCPSYTGMAQFVSHFAEPGDPEYAPPVEKGETPAQRRARIHKLRLDEGARKAAEELEKYDPHKDPNITGDPYKTLFVARLNFETTEHRIKREFEAYGPIKRVRLITDKVTNKPRGYAFIEYMHTRDMKTAYKQADGRKVDNKRVLVDVERGRTVPNWRPRRLGGGLGSTRIGGEEVNQKYSGREQQQVASGRPRSEEPRSRDDRLLDREKSRERGRERERERSRERSYDRTRDRDTREERHHHRDRDRNRDRDRDRERNRGRDRDRASDRDRERDRGRDYDRDRERERDRDRPRERERERERDRDYDRASHERGRGHTHERDAHYDYIEPKHDREMPGMNVRDFDYGESNHGREWYDGPKHGQEHDYYRYEQQRNQEQYDYQEHHGLGDPQHDLEHPRRHDHEYYDHAPYDKVDPVNYHSEFNRAGSESREEGEAFGDQDYEHHRSERSLSHEYEN